Integrated Text Mining and Chemoinformatics Analysis Associates Diet to Health Benefit at Molecular Level

نویسندگان

  • Kasper Jensen
  • Gianni Panagiotou
  • Irene Kouskoumvekaki
چکیده

Awareness that disease susceptibility is not only dependent on genetic make up, but can be affected by lifestyle decisions, has brought more attention to the role of diet. However, food is often treated as a black box, or the focus is limited to few, well-studied compounds, such as polyphenols, lipids and nutrients. In this work, we applied text mining and Naïve Bayes classification to assemble the knowledge space of food-phytochemical and food-disease associations, where we distinguish between disease prevention/amelioration and disease progression. We subsequently searched for frequently occurring phytochemical-disease pairs and we identified 20,654 phytochemicals from 16,102 plants associated to 1,592 human disease phenotypes. We selected colon cancer as a case study and analyzed our results in three directions; i) one stop legacy knowledge-shop for the effect of food on disease, ii) discovery of novel bioactive compounds with drug-like properties, and iii) discovery of novel health benefits from foods. This works represents a systematized approach to the association of food with health effect, and provides the phytochemical layer of information for nutritional systems biology research.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی مقایسه‌ای هزینه-سود سیستم تلفیقی تهویه صنعتی و مرطوب‌سازی با فیلتر خانه در یک شرکت فرآوری مواد معدنی

Introduction: Control of fugitative dust from mining process and application of an appropriate and economical system for dust collecting is essential. The goal of this study was cost-benefit analysis of an integrated systems and compare to bag filter in a crushing plant of a mining company. Methods: A local exhaust ventilation system for capture of emitted particlees, a water spray for dus...

متن کامل

Application of Information - Theoretic Concepts in Chemoinformatics

The use of computational methodologies for chemical database mining and molecular similarity searching or structure-activity relationship analysis has become an integral part of modern chemical and pharmaceutical research. These types of computational studies fall into the chemoinformatics spectrum and usually have large-scale character. Concepts from information theory such as Shannon entropy ...

متن کامل

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

Competitive Analysis of Online Reviews Using Exploratory Text Mining

Purpose – This paper explores the usefulness of analyzing text-based online reviews using text mining tools and visual analytics for SWOT Analysis, as applied to the hotel industry. These results can be used to develop competitive actions. Design – The text mining/visualization tool, ReviewMap, was used to transform an archive of reviews spanning multiple suppliers into a hierarchy of data of i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2014